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Abstract 

We work out a theory of approximate quantum error correction that allows us 
to derive a general lower bound for the entanglement fidelity of a quantum code. 
The lower bound is given in terms of Kraus operators of the quantum noise. This 
result is then used to analyze the average error correcting performance of codes 
that are randomly drawn from unitarily invariant code ensembles. Our results 
confirm that random codes of sufficiently large block size are highly suitable 
for quantum error correction. Moreover, employing a lemma of Bennett, Shor, 
Smolin, and Thapliyal, we prove that random coding attains information rates 
of the regularized coherent information. 



1 Introduction 



Physical processing, transmission, and storage of quantum information unavoidably 
suffers from decohering interactions with the environment. The insight that the 
resulting errors can, in principle, be corrected has been a major breakthrough in the 
field of quantum information theory [H [2]- A theory of quantum error correction 
(QEC) rapidly evolved [31 HJ [5] and eventually led to the concept of quantum fault 
tolerance [Bj, which, in fact, put large-scale quantum computation back in the realms 
of possibility. Quantum error correction stands in close relation to the information 
capacity of a noisy quantum channel and the quantum coding theorem El [9] . 

In this paper we elaborate a theory of approximate QEC. We obtain a general 
and easily computable lower bound for the entanglement fidelity of a noisy channel M 
that is attainable when the information is encoded in a given error correcting code. 
The bound is expressed in terms of Kraus operators of jV, and the projection on the 
code space. (Sec. [3]) 
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We employ this theory to analyze the average error correcting performance of 
codes that are chosen at random from certain code ensembles. For the unitarily 
invariant ensemble of all K-dimensional code spaces we find a surprisingly simple 
lower bound for the averaged code entanglement fidelity. Its deviation from unity 
is determined by y/KN \\ N{ttq) \\f, where N is the number of Kraus operators 
in an operator-sum representation of the noise N under consideration, ttq is the 
homogeneously distributed state of the system Q on which M is operating, and || A \\p 
denotes the Frobenius norm V trA^A of an operator A. We derive this result by 
reverting to random matrix theory. For the special case of unital noise the lower 
bound immediately reveals that randomly chosen codes attain with high probability 
the quantum Hamming bound [3] (Sec. 

Our next issue is the extension of the foregoing considerations to the case of noise 
operations that do not conserve the trace. We find it useful to understand them as the 
result of a selective process and therefore define fidelities and coherent information in 
this situation slightly different from the standard definitions in literature (see, e.g., 
[TO]) (Sec. E]). 

One motivation why we extend our theory to trace-decreasing operations becomes 
apparent in the last section. Here we show that with the aid of a recent lemma of 
Bennett, Shor, Smolin, and Thapliyal (BSST) [111 02] our results allow a relatively 
simple proof of the direct coding theorem. Our proof follows ideas of Shor [13] 
and Lloyd [7] by showing that QEC based on random code spaces attains rates 
of the regularized coherent information. The proof is therefore quite different from 
Devetak's one |14| . which is based on a correspondence of classical private information 
and quantum information (Sec. E|). 

After having clarified some conventions and notations, we will start in Sec. [2] with 
a brief introduction to QEC and quantum channel capacity. The remaining sections 
are organized as laid out above. 

1.1 Conventions and notations 

We denote a general mixed state by p, a general pure state by ip, and add subscripts 
to indicate the system. For instance, i/jqr means a pure state of the joint system of 
Q and it!. 

We will use the trace norm || A \\tr= trv 7 At A, and the Frobenius (Hilbert-Schmidt) 
norm || A \\f= VtrA^A for a linear operator A. The two-state fidelity is here defined 

as F(pi,p 2 ) =\\^pl^P2\\tr- 

2 Quantum error correction and quantum capacity 

Throughout the paper, we consider a quantum system Q that is supposed to store 
or transmit quantum information. We denote the Hilbert space of Q by Hq and its 
finite dimension by M. In addition to a possible internal unitary dynamics, Q is 
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subjected to external noise during storage or transmission. Let the effect of both be 
described by a completely positive, trace-preserving mapping M that maps an initial 
density operator p to a final density operator p' = N(p) [El [16]. We call M either a 
noise operation or, synonymously, a noisy channel. M can be always represented in 
an operator sum 

N 

Af(p) = 1 £AipAj , 

i=i 

where the (non- unique) Kraus operators A\, . . . , An are linear operators on Hq . They 
satisfy the completeness relation YU Ai'A{ = 1q. 

2.1 Quantum error correction 

In general, a QEC scheme for the noise N on Q is based on a quantum error correcting 
code C, which, by definition, is a certain linear subspace C of Hq. Let K be the 
dimension of C, and let P be the projection on C. We call a state p a state in C 
or a code state (of C) if the support of p is a subset of C. If the code C is suitably 
chosen, one may find a recovery operation 1Z that exactly recovers all code state from 
corruption by M, i.e., for all code states p of C, 1Z o M{p) = p. 

Finding an optimal code C for the correction of some given noise N is a difficult 
task. The code C should be of course as large as possible, but at the same time the 
encoding in C must also be sufficiently redundant such that errors caused by M can 
still be identified and corrected. In practice, the code may also satisfy additional 
technical constraints. Somewhat simpler than this problem but nevertheless instruc- 
tive is the following related one: Given the noise operation J\f, what can be gained 
by the use of a certain quantum code C? Here, theory does provide definite answers 
in the form of necessary and sufficient conditions for the feasibility of quantum error 
correction. 

First, there are quite elementary necessary and sufficient conditions for exact QEC 

[311 EKE]: 

Exact recovery of all code states is possible if and only if for all i,j the operators 
PAj^A{P are proportional to P, 

PA/A t P = htrPAjAiP) P . (1) 

K 

For explicitly given Kraus operators A{ it is usually no problem to check these 
conditions. If they are satisfied, it is also possible to explicitly construct the Kraus 
operators for the recovery operation 1Z. Things become more complicated when the 
conditions are violated. In this case, it can become quite difficult to foresee whether 
the violation is serious, and therefore error correction virtually impossible, or whether 
the violation is harmless and code states are still essentially correctable up to some 
small deviations. An early approach to this problem has been given in |17j . 
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An alternative condition for QEC can be formulated in terms of coherent infor- 
mation [18]. The coherent information I(p,M) of a state p with respect to the 
noise M is defined by 

I{p,M) = S{M{p)) - S{1 R ®N{^ RQ )) , 

where S(g) = — tr glog 2 g is the von Neumann entropy, ipRQ is a purification of 
p, and Xr is the identity operation on the ancilla system R. The last term, S(Ir ® 
M(iPrq)), is the entropy exchange S e (p,M) of p with respect to M [19]. The coherent 
information obeys an important inequality [H]: For any two operations £\ and £2 

S{p)>I(p,£ 1 )>I(p,£ 2 o£ 1 ). (2) 

Moreover, equality in the first inequality holds if and only if the action of £\ on p can 
be completely reversed, meaning that there exists an 1Z such that Tr<S>(JZ.o£\){^)rq) = 
iJjrq, for any purification iJjrq of p. This leads to the following necessary and sufficient 
condition for error correction [18J: 

Exact recovery of all code states is possible if and only if for a state pc with 
supp (p c ) = C 

S{ Pc ) = I{pcM)- (3) 

Schumacher and Westmoreland [20] have shown that this condition is robust 
against small perturbations, i.e., if it is only approximately satisfied, then errors 
can still be approximately corrected. Their central result is a lower bound for the 
entanglement fidelity [19] of an arbitrary state p under the noise M and a subsequent 
recovery operation 1Z. It is proven that for given p and J\f there exists an 1Z such 
that 

F e {p,KoN) > l-2^S(p)-I(p,Af). (4) 

To elaborate on this, let us discuss entanglement fidelity and its relevance for our 
purposes. 

2.2 Entanglement fidelity 

The entanglement fidelity F e (p,£) of a state p under an operation £ on Q is defined 
by 

F e (p,£) := (Tp R Q \1r®£(iPrq)\'<Prq) , 

where tpRQ i s an Y purification of p. That this is independent of the chosen pu- 
rification can be seen from the representation in terms of Kraus operators of A/ - , 
F e (p,£) = J2iL\ \trpAi\ 2 [TI5]. Especially interesting is the entanglement fidelity of 
the homogeneously distributed code state ire = P/K. The reason is that F e (irc,£) 
is a lower bound of the code-averaged channel fidelity F av (C,£) (Appendix lA.il cf. 
|2H 122]). Moreover, it can be shown that when F e (jrc,£) is close to unity, C must 
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have a large subcode CcC with a similar high minimum fidelity F m i n (C,£) (Ap- 
pendix IA,2p . The entanglement fidelity F e (irc,£) is therefore a convenient figure of 
merit that characterizes the distortion of states in C under the operation £ . 

In order to capture the suitability of a code C for QEC without referring to a 
certain recovery operation we introduce 

F e (C, Af) := maxF e (ir c ,KoN) , (5) 

the entanglement fidelity of the code C under noise Af. By relation (j4j) it is then clear 
that 

F e {C,M) > 1 - 2yJS(ir c ) ~ I{*cM) • (6) 

This shows that for small S{nc) — I(^CiAf) <C 1 the code entanglement fidelity is 
close to unity and thus approximate QEC is possible. 

Building on ideas of Schumacher and Westmoreland's proof of relation 0, here 
we will derive an alternative lower bound for the code entanglement fidelity F e (C, Af) 
that is explicitly given in terms of the Kraus operators of Af (cf. relation ([9]) in Sec. [3]). 
However, before we start, let us briefly point out that the code entanglement fidelity 
([5]) can also be used to conveniently define quantum capacity of a noisy channel. 

2.3 Quantum capacity of a noisy channel 

We consider the following scheme of information transmission from Alice (sender) to 
Bob (receiver) by means of the channel Af |8j: Alice is allowed to encode quantum 
information in blocks of n identical copies of Q, with the block size n and the encoding 
operation £ n at her disposal. Sending the block to Bob, each individual system Q 
is independently disturbed by the noise operation Af, i.e., the whole block Q n is 
subjected to Af® n . Bob is allowed to perform any decoding operation 1Z n in order to 
restore the message which Alice originally sent. The maximum amount of quantum 
information, measured in units of qubits, that can be reliably transmitted per channel 
use in such a scheme defines the quantum capacity Q(Af) of the noisy channel Af [8]. 

Precise mathematical definitions of the quantum capacity can be given in many 
ways [23]. Here we use one that fits in the present context of approximate QEC and 
the code entanglement fidelity. 

It has been shown that restricting the encoding operation £ n to isometric embed- 
dings into Hq U has no effect on the capacity [9]. £ n is thus sufficiently described by 
the subspace C n of Hq n whose code states represent the encoded information. View- 
ing C n as an error correcting code, Bob is able to reconstruct Alice's message within 
a precision that is given by the code entanglement fidelity F e (C n ,Af® n ). We follow 
the standard definitions and call R an achievable rate of Af if there is a sequence of 
code spaces C n C Hq n , n = 1, 2, . . ., such that 

l im sup lQ g2dimg w ^ ^ F mj^) = 1 . (7) 

n— »oo n n— >oo 
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The quantum capacity Q(N) is the supremum of all achievable rates R of N. 

The quantum coding theorem for noisy channels [8] [9] states that the quantum 
capacity Q(N) of a channel N equals the regularized coherent information 

UN) = lim -maxI{p,N 9n ) ■ (8) 

n^oo n P 

I r {N) has long been known an upper bound for Q(N), which is the content of the 
converse coding theorem [8] [9]. The direct coding theorem, stating that I r (N) is 
actually attainable, has been strictly proven only recently by Devetak [14]. His proof 
utilizes the correspondence of private classical information and quantum information. 
More direct proofs in the spirit of Shannon's ideas on random coding [24] have been 
earlier outlined by Shor [13] and Lloyd [TJ. In the last section, we will employ our 
theory to provide a strict proof along these lines. 



3 Lower bound for the code entanglement fidelity 

In this section we derive a lower bound for the code entanglement fidelity F e (C,N) in 
terms of Kraus operators A\, . . . , An ofN and the projection P on the /f-dimensional 
code C. We will show that 

F e (C,N) > 1- \\D\\tr , (9) 

where 




is an operator on He, with He being an ancilla Hilbert-space spanned by or- 
thonormal vectors |1), . . . , \N). 

The coefficients of D precisely correspond to the conditions ([1]) for exact error 
correction. If these are fulfilled the operator D vanishes and inequality ^ also 
predicts perfect error correction. In this sense, the lower bound ([9]) can be considered 
as a generalization of the elementary conditions (pQ) to the case of approximate QEC. It 
is worth mentioning that the lower bound does not depend on the chosen set of Kraus 
operators Ai,..., Ajy for N. Equivalent sets are related by a unitary transformation 
[16J which in Eq. (fl~0|) amounts merely to a unitary basis change, and therefore leaves 
|| D || i r invariant. 

To prove relation © we describe N as a unitary Uqe on Q and an environment 
E, followed by a partial trace over E [15], [16] . That is, for a general state pq 

N{pq) = tr B Uqe pq® U qe ] , 

where ipE is some fixed initial state of E. Further, let ipRQ be a purification of pq, 
let pr = trg tpRQ, and let a pure state 4>'rqe on RQE be defined by 

^'rqe = (1r ® u qe) ^rq ® ^e (1/? ® U QE ] ) ■ 
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iP'hqe purifies its partial states 

Pq = to RE i>' RQE , p' E = tr RQ ip' RQE , 
Pre = tr Q i{j RQE , p' RQ = ti - E ^' RQE . (11) 

Following ideas that has been utilized in [20] and |25j we show that there exists a 
recovery operation 1Z on Q such that 

F e {p Q ,TZoN) > 1- Wp're- PR®P E \\tr • (12) 

The idea is to find in the vicinity of the actual final state ip' R Q E (or an extension tp' of 
it) a state ip from which i[)rq can be perfectly recovered by an operation 1Z on Q. The 
distance between ip' and ip will then determine a lower bound for the entanglement 
fidelity F e {p Q ,KoM). 

To this end, we consider the product state pr <g> p' E with its obvious purification 

4> ■= ^RQ ® ^RQE 

on the joint system RQSE, where S denotes a copy of RQ. We extend ^'rq E to a 
pure state ip' on RQSE by some pure state tps of S (i.e. tracing out S or RQE yields 
iP'rq E or -05, respectively). According to Uhlmann's theorem [29j [30J [16] there is a 
unitary Uqs on QS such that 

MUqs^')? = F(p R ®p' E ,p' RE ). (13) 

Then, for a recovery operation 1Z on Q defined by 

we find 

ZR ® Wrq) = tr 5B C/qs ^ CW , 

which by the monotonicity of the fidelity under partial trace [16] and i/jrq = trgg i/> 
yields 

F e (p Q ,lZoM) = F(ip RQ ,I R ®K(p' RQ ) ) > |$| U QS iP')\ 2 • 

With Eq. (fT3"|) and the general relation F(p,a) > 1— || p — <r ||t r [16] this proves 
relation ([T2]) . 

Now, we become more specific and chose for given Kraus operators A\, . . . , An of 
A/" its representing unitary Uqe such that 

iV 

U QE \iPQ}\l)='£A i \i> Q )\i), (14) 

8=1 
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where |1) = \4>e}, |2), • • • , \N) are orthonormal vectors in He- Further, let pq = ttq 
P/K with purification 



-T^EiW). ( 15 ) 

,=i 



where the orthonormal vectors |cf ), . . . |c^} and ), . . . \c^) span Hr and C, re- 
spectively. For this setting, we obtain 

-. N K 

PRE = ^E E tr Q (^| C f)(c«|A/)|cf)(c^|®H}01, (16) 

ij =1 i,m=l 

AT 

Pi? ® /4 = E trg^vrcA/) p fl ® \i)(j\ . (17) 

Things become more convenient if we isometrically map both states with an isometry 
defined by 

J ■ E «y,/ml C f)( C ml ® l*>01 ^ E «*i,Zm|cft(c£| ® 
ij,lm ij,lm 

to 

1 * 

* := ^he) = ^ E PAUjP ® |*>01 

y=l 
1 * 1 

r := J(pr®Pe) = ^E^ tr ( P ^% P ) p ®K)0'l- 

»i=i 

Hence, || — Pr® p'e \\tr=\\ X — Y \\ tr , which with relation (fT2j) leads us to 

F e (ir c ,K°M) > 1- ||X-F|| tr . 

Since the left-hand side is a lower bound of the code entanglement fidelity F e (C, A/ - ) , 
and X — Y = D, this finally proves relation Q. 



4 Random quantum codes 

Random codes play an important role in classical as well as in quantum information 
theory. In this section we will analyze the average error correcting performance of 
random codes by means of the lower bound © for the entanglement fidelity of the 
codes. We consider the same setting as before: a quantum information storing system 
Q with M-dimensional Hilbert space Hq that is exposed to noise A/" with a set of 
Kraus operators A\, ... A n- 
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4.1 Ensemble averaged code fidelity 

Let Ek be an ensemble of iT-dimensional codes in Hq with an ensemble average [A] 
defined for code dependent variables A = A(C). We are interested in the ensemble 
averaged code entanglement fidelity [F e (C, M)]. By inequality Q, 

[F e (C,Ar)]>l-[\\D\\ tr ] , (18) 

where D is the code dependent operator Eq. ([10]) , 

In many cases, averaging the trace norm of D would be quite a difficult under- 
taking. We therefore prefer to estimate [||-D||tr] by the more convenient average of 
the squared Frobenius norm, [|| D |||,] = [tvD^D]: Trace norm and Frobenius norm of 
D with domain C (g> He of dimension d = KN satisfy 

||-D||tr< v 7 ^ ||D|| F . 

We remark that this inequality is a good estimate only if the eigenvalues of D are 
of similar magnitude. Using this estimate and employing Jensen's inequality [26] we 
obtain 

<y/d[\\D\\l], (19) 
and so 

[F e (C,M)] > 1 - y/KN[\\D\\ 2 F ] . (20) 

In the next subsection we will evaluate this lower bound for unitarily invariant code 
ensembles. 



\D\ 



tr 



< 



Vd[\\D\ 



Vd 



D 



|2 
\F 



4.2 Unitarily invariant code ensembles 

Let Uk be the unitarily invariant code ensemble that consists of all .fT-dimensional 
codes in Hq, furnished with the unitarily invariant ensemble average 

[A(C)] Uk := I dfi(U) A(UC ) , 

JU(H Q ) 

where Co C Hq is some fixed code space of dimension K, and \x is the (normalized) 
Haar measure on JJ (Hq), the group of all unitaries on Hq. Later on we will also 
consider an analogously defined ensemble Uk(V) that consists of if -dimensional codes 
in some subspace V of Hq. 

Our task is to calculate [|| D \\ F ]u K - By the explicit representation Eq. ([10]) of 
operator D we immediately find 

1 N 1 
|| D ||| = tr D*D = ^2 E tv(PWi/PWij) - -|trP^-| 2 , 

ij=l 
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where the operators Wij are 



Wi. 



The ensemble average of || D \\p can be conveniently calculated if we introduce a 
Hermitian form 



b(V,W) :-- 



tr(PViPW) - — tr(Pyt) t T (PW) 
K 



u K 



such that 



\D\\l 



1 



(21) 



(22) 



We recall that P is the projection on the ET-dimensional code space that is chosen 
with unitarily invariant probability from the ensemble Uk- By Eq. (I2ip it is therefore 
clear that b(V, W) is a unitarian invariant on Hq, i.e., for any U G XJ(Hq) 

b(UVU\ UWU ] ) = b(V, W) . 

This places us in a position to utilize the general theory of group invariants by Weyl 
\17\ I28j: In the present situation it means that b(V, W) must be a linear combination 
of the two fundamental unitarily-invariant Hermitian forms tr V'W and tr V^tr W, 



b(V, W) = a tr V*W + P tr V+tr W . 



(23) 



To determine the coefficients a and (3 we derive two linear independent equations by 
equating Eqs. (|2ip and (|23[) for two special choices of the operators V and W. For 
V = W = 1h we obtain as a first equation, 



a M + P M 2 = 



(24) 



Next, we set V = W = P\, where Pi is the projection on an one-dimensional space 
spanned by some unit vector \tp) £ Hq. From Eq. pip we immediately find 



b(Pi,Pi 



1 



1 

K 



P 



u K 



Reverting to results from random matrix theory, we obtain in Appendix[B] [| (t/j\P\i/j) | 2 ] 
[K 2 + K)/(M 2 + M) (which for large K and M is close to the naive estimate 

|p|^i 2 l rr „ « K^W)]L = ^ 2 /^" 2 )- Thus, 



1 



M 2 + M ' 

With b(P\,P\) = a + (3 from Eq. (|23|) this yields the second equation, 



a + (3 



K 2 



1 



M 2 + M ' 



(25) 
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Solving Eq.s (|24p and (|25p for a and /3, and inserting the solution into Eq. ([23 
produces 



b(V, W) 



1 



and, by Eq. 



I nil 2 

I u \\F 



M 2 — 1 



1 - l/K 



tr V*W - — tr W tr W 
M 



(26) 



In general, not much is given away if instead of this exact result we use an upper 
bound for [H^Hl]^ that we obtain by using (1 - l/K 2 )/(M 2 - 1) < 1/M 2 and by 
omitting the negative terms — |tr W{j\ 2 /M in the sum. Then 

M 2 



U K 



< _l y: tr wjwq = E 



M 



where we cyclically permuted operators under the trace to obtain the last equality. 
We realize that the argument of the trace is simply M(7tq) 2 , with 7Tq = 1q/M being 
the homogeneously distributed density operator on Hq. This yields the rather simple 
upper bound 



l^lll 



u K 



<W*q)\ 



By relation (|2Up this means 

[F e (C, N)}u K > 1 - VKN II AA(ttq) \\ f 



(27) 



(28) 



Before discussing this result let us generalize it to the unitarily invariant ensemble 
Uk(V) of isT-dimensional codes in a subspace V C Hq (dim V > K). Here the average 
is given by 



[A(C)] 



U K (V) 



U(V) 



dfjL V (U) A(UC Q ) , 



where \xy is the normalized Haar measure on the group U(V) of unitaries on the 
subspace V. Up to the fact that now the role of Hq is taken over by the linear space 
V nothing has changed compared to the situation before. Hence, the derivation given 
above for the ensemble Uk applies to the ensemble Uk{V) as well, showing that 



oi^iilW) <m*v)\\ 2 F , 

and consequently, 

[F e (C, N)]u K (v) > 1 - ^KN \\f 
where 7ry = Hy/ dimV. 



(29) 



(30) 



11 



4.3 Discussion 



It is instructive to discuss the just obtained lower bounds for the case of unital 
noise, which by definition leaves the homogeneously distributed state ttq invariant, 
M(ttq) = ttq. A unital operation is for instance the process where arbitrary unitary 
operations U\, . . . ,Un are applied to the system Q with probabilities pi, ■ ■ ■ ,pn- For 
unital noise \\M(ttq) \\p=\\ 7Tq \\p= t r ( 7r o) = Hence, by the lower bound ([28]) . 



[F e (C,Af)}u K > 1- 




This means that on almost all codes C of the ensemble Uk the unital noise Af can 
be almost perfectly corrected, provided that 

KN <M. 

Recalling that K is the code dimension, N is the number of Kraus operators in an 
operator-sum representation of Af, and M is the dimension of Hq, we recover that 
randomly chosen codes attain the quantum Hamming bound [3]. 

The requirement K <C M/N suggests that log 2 M — log 2 N is a lower bound of 
the capacity Q(Af), what we will now formally derive. To this end, we consider the 
ra-fold replicated noise Af® n , and study the averaged entanglement fidelity of the 
code ensemble Ujc n , where we chose the code dimension to be K n = \ 2 nR \ for some 
positive R. J\f® n operates on states in Hq 11 and has N n operation elements. With 
M also N® n is unital, thus \\N® n (pQ n ) f F = M~ n , and by Eq. 

In the limit n — > oo the right hand side converges to unity if R < log 2 M — log 2 N. 
Since limn^oo i log 2 K n = R this implies that all rates below log 2 M — log 2 N are 
achievable and so, by the definition of quantum capacity in 12.31 

Q{M) > log 2 M - log 2 N . 

We note that since J\f is unital log 2 M = S(itq) = S (Af (ttq)) . Now, if we could 
identify the second term, log 2 A r , with the entropy exchange S e (iTQ,Af) we would 
obtain that the lower bound log 2 M— log 2 N is just the coherent information I(ttq,M), 
in accordance to the capacity formula. However, this is the case only for a special kind 
of unital operations. M must have a Kraus representation with operation elements 
A\, . . . , An such that tvAj^Ai = for i ^ j, and jjtrA^Ai = const. = 1/N. Then 
by Schumacher's relation indeed 

S e (7T Q ,Af) = S ({ixA i 'K Q A j %^ l) ... N ) = S(1 N /N) = log 2 iV. 
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The first condition is actually no restriction, since a nondiagonal representation 
Si,..., ^ with txB^Bj / can always be unitarily transformed to a diagonal 
one (cf. footnote [1]) . The second condition demands that, roughly speaking, different 
kinds of errors appear with equal probability. In the end, this ensures that by the 
estimation || D ||t r < y/KN \\D\\p not much is lost and therefore the lower bound (I28D 
is good. 

To recapitulate, for unital noise M the lower bounds for the ensemble averaged 
code fidelities immediately make evident that the quantum Hamming bound is at- 
tainable by random codes. Moreover, if the noise M satisfies the condition of equally 
probable errors as specified above we can establish 

Q M >I(k q ,N). (31) 



5 Error correction in selective noise 

The hitherto presented analysis is restricted to trace-preserving noise operations. 
Here we will extend the considerations of the preceding sections to the case of trace- 
decreasing noise, which we find to be convenient in later use. First, we define channel 
fidelity and entanglement fidelity for a trace-decreasing channel. Within this defini- 
tions we will then generalize the lower bound ([9]) and the result (I30p on the ensemble 
averaged code fidelity. 



5.1 Fidelities for trace-decreasing channels 

For a (possibly) trace-decreasing operation M on a system Q we define the channel 
fidelity with respect to a state pq as 

F ch { PQ ,M) ■= trN( PQ ) F(p Q , ^g|y) , (32) 

where F(p, a) is the usual two-state fidelity. The definition deviates from the standard 
one by a factor tiM (pq). This makes sense, when one interprets a trace-decreasing 
J\f as a selective operation that selects individual elements of the initial ensemble pq 
with probability tiM (pq) [15j . Consequently, in order that F c h{pQ,M) is close to 
unity not only the selected final state M{pq) /tvM{po) must be close to pq, but also 
the selection probability must be close to unity. 

We define the entanglement fidelity of M with respect to pq as 

F e {p Q ,M) ■= F ch (^ RQ ,l R ®AT) = {^ R q\{Ir®J^){^rq)\iPrq) , (33) 

where i^rq purifies pq . Note that if M is trace-decreasing also its extension Ir (g) M 
is trace-decreasing, in which case F c h means the just defined fidelity (f32j) . Repeating 
the arguments of Schumacher [19J , it is not difficult to see that also the entanglement 
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fidelity of a trace-decreasing M can be expressed by its Kraus operators A\, . . . , 
of M by the usual formula 

N 

F e (PQ,N) = \ tr PQ A i\ 2 ■ (34) 

i=l 

A simple but important consequence of this relation is the following: Let for a subset 
N C {1, . . . , N} a quantum operation M be defined by 

Mpq) ■= E A iPQ A ^ > 
which we will call a reduction of the operation Af. Then by Eq. (|34p , 

F e (p,Af) > F e (p,N) . 

Further, since for any operation 1Z on Q clearly 7£ o is a reduction of 7£ o M, we 
conclude that for any code C 

F e {C,N) > F e {C,M) , (35) 

where the code entanglement fidelity for a trace-decreasing N is defined as for trace- 
preserving noise by F e {C,N) := max^ F e (irc> H -A/")- 

5.2 Lower bound for code entanglement fidelity 

Let N be a noise operation on Q that can be represented by Kraus operators 
A\, . . . ,Apf. The entanglement fidelity of a i^-dimensional code C satisfies 

F e (C, AT) > trAA(vrc) - ||D||tr , (36) 

where ttc = P/ K is the homogeneously distributed code state, and the operator D 
is defined exactly as in Eq. (fTUj) . 

This relation generalizes the lower bound Q to the case of a trace-decreasing 
operation ]\[. Its proof given in Appendix [C] is almost identical to the one of Eq. Q 
in Sec. El 

5.3 Unitarily invariant code ensembles 

We consider the ensemble Uk(V) of all i<C-dimensional codes in a subspace V of Hq 
which we introduced in Sec. 14.21 According to the lower bound (f36j) . the averaged 
code entanglement fidelity under a (possibly trace-decreasing) noise Af with Kraus 
operators A\, . . . , An satisfies 

[Fe{C,M)\ 

U K (V) - [ tv N(nc)]u K (V) [ll- D llt'"]u Jf (y) , 
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where D is given by Eq. (jlOp , As shown in Sec. HI 



[\\D\\tr] UKiy) < ^ kn W\ d \\ 2 f\ Uk(V ) ^ y/KN WMWf , 

where 7iy = IL//dimV (cf. Eq.s (|19p and (|29p ). Furthermore, we will show below 
that 

[tlM(7Tc)] UKiV ) = trAA(TTy) , (37) 

and thus obtain 

[F e (C,Af)] UK{v) > trAA(vry) - ^KN \\ M{v V ) \\f ■ (38) 

We show Eq. (|37p by again referring to unitarian invariants: Let a linear form a 
on the set of all linear operators on Hq be defined by 

W~a(W) :=±[trPWP] UK{v) , 

where, as always, P = He- Since the codes C are subspaces of V it is clear that 
a(W) = aiJlyWYly). Further, the unitarian invariance of the code ensemble entails 
a(W) = a(UWU^) for all unitary transformations U on Hq with U(V) = V. It 
follows that a must be proportional to the fundamental invariant linear form on V, 
W h-> tr(IIyPF). From a(ILv) = 1 we can then deduce that a(W) = tr(7ryW). To 
conclude the proof of Eq. (|37p we note that 

N j N N 

[trAf(7rc)W(y) = ^^Mi-PA^^cv) = ^a(AiUi) = tr^A^vA 1 = trAA(^). 

6 Lower bounds for the quantum capacity 

In this section we will prove that the quantum capacity Q(N) of a general trace- 
preserving channel M satisfies 

Q(M)>I(tt v ,M), (39) 

where Try is the homogeneously distributed density on an arbitrary subspace V of the 
system's Hilbert space Hq. We will then use the lemma of BSST in order to establish 
the regularized coherent information I r (M) (cf. dH)) as a lower bound of Q(N). 

We first prove inequality (j39l) for the case V = Hq or ixy = txq. A strategy of proof 
becomes evident when we look back at Sec. 14.31 where we showed Q(M) > I{tvq,M) 
under the conditions of 

(i) equally probable errors, and 

(ii) unitality: M(ttq) = ttq. 
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For general noise M these two requirements are certainly not fulfilled, not even ap- 
proximately. However, since our concern is the channel capacity of M we are free to 
consider the n-times replicated channel J\f® n . For large n it is possible to arrange for 
the conditions (i) and (ii) in an approximate sense by, as it will turn out, only minor 
modifications of the operation M® n . Following Shor [T3], we 

(a) reduce the operation ]\[® n to an operation J\f n that consists only of the typical 

Kraus operators of M n (cf- Sec. 16,1.2]) . 

Thereafter we 

(b) project on the typical subspace T n of A/"(7tq) in Hq U (cf. Sec. 16. 1.3p . 

The purpose of reduction (a) is to approximately establishes a situation of equally 
probable errors (i). The second step allows to restrict the output Hilbert space of M n 
to the typical subspace T n , on which the density J^ n (^Q n ) is approximately homoge- 
neously distributed. This establishes a situation similar to (ii). After having proven 
Eq. (I39j) for V = Hq in Sec. 16.2.11 we will argue in Sec. 16.2.21 that its generalization 
is trivially obtained by restricting the original input Hilbert space Hq of AA to a 
subspace V C Hq. Finally, in Sec. 16.2.31 we use the lemma of BSST in order to show 
that Q{N) > I r {N). 

6.1 Reduction of the noise J\f® n 

Both, typical Kraus operators and typical subspaces are defined on the basis of typical 
sequences (see, e.g., [IE]). We briefly recall their definition and state two basic facts 
that are important for our purposes. 

6.1.1 Typical sequences 

Let Xi, X2, . . . , X n be a sequence of independent random variables that assume values 
A%, . . . , An with probabilities pi, P2, ■ ■ ■ ,Pn- We denote the probability distribution 
by A. Its Shannon entropy is H (A) = — Yl%=i Pi 1°§2 Pi ■ Let e be some positive 
number. A sequence A = Aj ± , Aj 2 , . . . , Aj n is defined to be e- typical if its probability 
of appearance pa = PnPh ■ ■ ■ Pj n satisfies 

2 -n{H(A)+e) < pA < 2 -n(H(A)-e) _ 

Below we will make use of the following two facts: 

1. the number of all e typical sequences A £j „ is less than 2 n ^ Hi - A ' >+e \ 

2. the probability P £ ^ n that a random sequence of length n is e-typical satisfies 
1 - Pe,n < 2e~ n ^ , where ip(e) is a positive number independent of n. 

Proofs can be found in Appendix [Pi 
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6.1.2 Restriction to typical Kraus operators 

Let a trace-preserving noise M on Q be represented by Kraus operators A\ , . . . , An ■ 
Without loss of generality we can assume that the Ai are diagonal in the sense that 
tr A ^Ai = for i ^ j. We define the probability pi of the Kraus operator Ai as 

Pl := jjtrAfAi , (40) 

and we denote the corresponding probability distribution by Aj^. The definition 
makes sense, because the pi are positive and, as a consequence of the trace preserva- 
tion of M, sum up to unity. 

The n-times replicated noise ]\[® n can be represented by N n Kraus operators 

A n ®A h ®...® A jn = Ai , 

where j u = 1,...,N and j = (ji, j'2, • • • , j n )- By the diagonality of the operators Ai 
of J\f also the operators A^ of M® n are diagonal, and the probability pj of the element 
A^ appears to be the product of the probabilities pj v of its constituent elements Aj v , 

Pi = iik trAjtylj = i tr ^ ty1 ^ • • • taiAjjAjn) = Pjl ■ --Pin ■ 

In other words, the Kraus operators Aj of J\[® n are sequences of length n in which 
symbols Ai of an alphabet A\, . . . ,Ajy appear according to the distribution Aj\f. 
Hence we are in the domain of classical random sequences and can employ the notions 
of Sec. 16.1.11 to define the e-typical operation N £ . n of N® n by 

p ^ KM '■= a ip a j, 

Aj e-typical 

i.e., A/" e ,n consists only of the e-typical Kraus operators of N . In general, this strongly 
reduces the number of Kraus operators from N n to 

(cf. Sec. 16.1. 1| property 1.). It is time to remark that H(A_\f) is nothing other than 
the entropy exchange S e (TTQ,J\f), such that the last relation becomes 

N E>n < 2 n{s ^' Af)+£) . (41) 

To see this, we notice that H(A^f) equals the von Neumann entropy of an N- 
dimensional diagonal density matrix W with elements W„ = itrvlj ^ Ai. Since 

1 For arbitrary operation elements Si, . . . , Bn of M let an N x N matrix H be defined by 
since H = H\ there is a unitary matrix U such that UHU^ is diagonal. Because 
of the unitary freedom in the operator-sum representation |16] . the operators A m := y\ U^ m Bj 
equivalently represent Af. It is readily verified that tr^4;' A m = for I ^ m. 
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we are working in a diagonal operator-sum representation, this actually means that 
Wij = jjtrAfAi = t?AiiTQAj\ where ttq = 1q/M. By Schumacher's representation 
of the entropy exchange we thus realize that H(A_\f) = S e (^KQ,J\f). 

Despite its strongly reduced number of Kraus operators, in average the operation 
N Et n does not much reduce the trace when n becomes large. This can be seen by 
the selection probability trA4,n(7r<5„) of the homogeneously distributed state HQ n = 
lg n /M n . A lower bound can be derived by observing that 

trA4,„(vr Q J = ^ tr Ai4j t= E Pi 

Aj £-typical Aj e-typical 

is the probability that an operation element A^ of J\f® n is e-typical 
16.1.11 property 2., 

trA4,n(7r Q J > l-2e-^( £ ), 
where ipi(e) is a positive number independent of n. 

6.1.3 Projection on typical subspace 

We will further reduce the operation J\[® n by letting Me,n follow a projection on the 
e-typical subspace T £i „ C Hg n of the density M(ttq). The benefit of this procedure is 

that the so obtained operation 7V £ ,n maps TTQ n to an almost homogeneously distributed 
state on T £) „, and thus establishes a situation similar to (ii) in Sec. ([5]). 

The e-typical subspace T £)n C Hq U of a = N{ttq) is spanned by the e-typical 
eigenvectors of cr® n [TB]. These are precisely the eigenvectors v\ with eigenvalues p\ 
satisfying 

2 -n(5(A^(7r Q ))+ £ ) < pi < 2 -™(S(A%Q))-e) . 

The dimension of T £) „ obeys 

dimT £jn < 2 "( s ( A/ '(^))+ e ) . (43) 

If n is large, almost the entire weight of <7® n lies in the e-typical subspace: Let n 
be the projection on T e>n , then 

trn £) „a®"= Yl PI' 

I : \vi) e-typical 

which in the notions of Sec. l6.1.i1 is the probability that an eigenvalue \ v\) = |%)|fz 2 ) • • • 
is e-typical. Thus, by the second property in Sec. 16.1-H 

trn £iri cr 0n > i-2e- n ^ £) , (44) 

where V^^) is a positive number independent of n. 



. Thus, by Sec. 

(42) 
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We define the e-reduced operation of M® n by 
where the operation V £jTl describes the projective measurement on T e>n , 

v £ , n ■■ p !->■ n £) „ p n £n , 

and M £ ,n is the e-typical operation of M® n as defined in the previous subsection. 
6.1.4 Properties of the e-reduced operation jv e>n 

The e-reduced operation N e , n can be represented by Kraus operators of the form 
n £;n ^4j, where A- s is an e-typical operation element of N® n . Their total number N e n 
is therefore bounded by 

N £iU = N £>n < 2< S ^Q< N ^ . 

Besides the number of Kraus operators, the two other crucial figures are trA/" £)n (7rQ n ) 
and || Me,n(^Q n ) \\f (cf- relation ([55])). In Appendix[E]we derive the followings bounds: 

trK tn (n Qn ) > l-4e-^( £ ), 

where ^(e) is a positive number independent of n. Finally, we note that for any 
codeCc#| n 

F e (C, N® n ) > F e {C,Ne,n) > F e (C,M £ , n ) . 

The first inequality holds because Af £>n is a reduction of N® n and the second one is 
explained by the fact that M £>n results from post-processing of A4,n by which 
cannot increase the code entanglement fidelity (cf. Eq. (JSJ). 

6.2 Lower bounds for Q(M) 

Lower bounds of the quantum capacity Q(N) are given by the achievable rates of M. 
Finding out whether a rate R is achievable or not requires to investigate the code 
entanglement fidelities F e {C n ,N® n ) for suitable codes C n C H® n (cf. Sec. EH}. Our 
working hypothesis is that no special care has to be taken in choosing C n . Rather, 
we suppose that randomly chosen codes in general do provide high achievable rates 
and therefore will study the averaged entanglement fidelity of the code ensembles 
introduced in 14.21 
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6.2.1 Q{N) > I{itq,N) 

We begin with the average code fidelity [F e (C n , A/"® n )]c/ K of the unitarily invariant 
ensemble Uk„- As in 14.31 we chose the code dimension to be 

K n = [2 nR \ , 

meaning that R = lim n ^ 00 — log 2 K n is the asymptotic rate. By relation (I38p and the 
results of the previous subsection we immediately find 

[F e {C,N® n )]u Kn > [F e (C,M s>n )]u Kn >l-a n -(3 n (45) 

with coefficients 

a n = 1 - tr A4 , n (7r Q J < 4e~ n ^\ 

0n = y/K^\\&A*Qn)\\F < 2f(* + ^QX)-S(A/> Q )) + 4 £ ). 

Clearly, for all e > 0, the right-hand side of inequality (|45p converges to unity in the 
limit n — > oo if the asymptotic rate R obeys 

R + Ae < S(M(tt q )) - S e (7r Q ,M) = I(tt q ,M) . 

That is, all rates R below I(ttq,N) are achievable and therefore I(ttq,M) is a lower 
bound of the capacity Q(J\f). 

6.2.2 Q{N) > /(Try, AO 

Let V be an arbitrary linear subspace of the system's Hilbert space Hq, and let 
iry = Hy / dimV. In short, the coherent information I(jrv,AT) can be established as 
a lower bound of Q{J\f) in exactly the same way as before I{jtQ,N) if we consider 
instead of M the operation C that is defined as the restriction of N to states py on a 
reduced input Hilbert space V C Hq. For the sake of completeness, we briefly repeat 
the arguments. 

This starts with reducing C® n to an e— typical C £tn as described in Sec. 16.1.21 : 
The reduced input Hilbert space V of C entails that now the probability pi of a Kraus 
operator A{ has to be defined as 

Pi = jtTU v AMiU v , (46) 

where L = dimV, and Hy is the projection on V. Here it is assumed that the 
operators Ai,...,A^ are diagonal with respect to V, i.e. trn^ylj ^A,-Hy = for 
i j. Accordingly, the probability of a Aj = Aj x (8) ... (8) Aj n is 

Pj^trH^UjHf = Pjl ...p jn . 
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As before, £ £) „ is defined to consist only of the e-typical Ay Its number L £j „ is 
bounded by 2 n ( H+£ ^ , with H being the Shannon entropy of the normalized probabil- 
ity distribution (|46p . Therefore, H conincides with the von Neumann entropy of a 
diagonal density matrix W with entries 

Wij = -tr U v AMjU v = tr Aj—A- 1 . 

By Schumacher's representation of the entropy exchange we obtain H = S e (irv,M), 
where 7iy = ily/L. 

The next step is to further reduce C e ^ n to an operation £ £i „ by projecting the 
output of C e>n on the typical subspace T £i „ C Hq U of the density C(nv) = N(ttv)- 
This follows precisely Sec. 16.1.31 with a = N{ttq) replaced by a = N^v)- The 
resulting C e<n is characterized by (cf. Sec. I6.1.4jl 

j <s 9 n(S e (7r v , M)+e) 

^£,1% — & ) 

tr£ e ,n > l-4e~ n ^ (£ \ 

H Z i|2 < 9 -n(S(N(7T V ))-3e) 
|| ■ i -£,n \\F — L 

F e (C,£® n ) > F e (C,£ £ , n ), 

where L £:Tl is the number of Kraus operators that is needed to represent C e>n . Thus, 
by inequality ([38]) . 



[F e {C,C® n )] UKn (ysiu- ) > l — a n — (3 n , 

where the coefficients a n and j3 n are as in the previous subsection, but with ttq 
replaced by ny. Since further [F e (C, ■N'® n )\u K {V® n ) = [Fe(C, C® n )}u Kn (y<gm-) we can 
thus conclude that all rates R below 

S(Af(Tr v )) - S e (Tr v ,Af) = I(nv,Af) 

are achievable by ftf, meaning that Q[N) > I(irv,N). 

6.2.3 Q(Af) > / r (A0 

Finally, we will show that with the BSST lemma the result of the last subsection 
implies the lower bound 

Q(AT) > —l{p,M® rn ) , 

m 

where m is an arbitrary large integer, and p any density on Hq™ 1 . Clearly, this suffices 
to prove the regularized coherent information I r (M) (cf. Sec. 12.31 ) a lower bound of 
Q{M). 

The BSST lemma [11] states that for a channel M and an arbitrary state p on 
the input space of M 

lim lim -S(A^(7r £ ,„)) = S(N(p)) , 
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where n e n is the homogeneously distributed state on the frequency-typical subspace 

r» < 

tion, 



(f) ( 
Te,n of p. As a corollary, one obtains an analogous relation for the coherent informa- 



lim lim -I(n E , n ,M® n ) = HpM) ■ 

( f) 

Te,n is similar to the ordinary typical subspace T £j „ which we have used above. The 

( f) 

difference is that for T £jn typicality of a sequence is defined via the relative frequency 
of symbols in this sequence, whereas for T £i „ it is defined by its total probability. For 
details we refer the reader to the work of Holevo [12] , where an elegant proof of the 
BSST lemma is given. 

Here, what matters is solely the fact that 7r £) „ is a homogeneously distributed 
subspace density of the kind that we used in the previous subsection. Thus we can 
make use of the bound Q(£) > I{ny,£) with, for instance, £ = jV®" 1 ™, and V being 
the frequency- typical subspace T^n C iJ® mn of an arbitrary density p on Hq" 1 . This 
means that for any e > and any m, n 

Q(M® mn ) > I{Ti £ ^M® mn ) ■ 
Using the trivial identity Q(N® k ) = kQ(M) we can therefore write 

Q{M) = — lim -Q{M® mn ) 

771 n^oo n 

> — lim lim -J(7r e n , (AA® m )® n ) 

TO e-^On— *oo n ' 

= -I(p,AT® m ), 

TO 

where the last equation follows from the corollary. 



7 Concluding remarks 

We expect that the lower bound ([9]) for the code entanglement fidelity is also useful for 
directly evaluating the error correcting capability of a particular code for a particular 
noise operation. In this case, there is no need to estimate the trace norm of the 
operator D by its Frobenius norm. The only reason why we used this in general 
rather poor estimate here is that it enabled us to perform the ensemble average. 

The above proof of the direct coding theorem shows that a randomly chosen 
code of sufficiently large block-size is typically a good quantum error correcting code. 
Studying the properties of unitarily-invariant code ensembles might be therefore al- 
ways a good thing to do when general aspects of QEC are of concern. 

Note added. We would like to mention the recent eprint of Hayden et al. [31], in 
which a similar proof of the direct coding theorem has been independently obtained. 
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A Fidelity relations 

A.l F e (7T C ,S)<F av (C,S) 

The average fidelity of the code C with respect to noise £ is defined as 

Fan 

(C,£) = dp c {U) F ch {U^U\£) , 
JU(C) 

where F c h(p,£) = F(p,£(p)), ipo is an arbitrary pure state in C, and pc is the 
normalized Haar measure on the group U(C) of unitaries on the code space C. For 
a complete ensemble t/ji, . . . , ipx of orthogonal pure states in C, K = dimC, we find 

1 K 

F av (C,£) = / dp c (U)-Y, F ch{U^iU\£) 

> f dp c (U)F e (±Y,Ui>itf,£) =F e (ir c ,£). 
The inequality follows from the general relation [TB] 

Y^PiF ch ( Pi ,£) > F e (^ PiPu £^j . (47) 

A. 2 Subcodes with high minimum fidelity 

Let C be a code of dimension K with entanglement fidelity 

F e (TT C ,£) = 1-e. 

We will show that there is a subcode C of C of dimension K = \_K/2\ with minimum 
fidelity 

Ffnin 

(C,£) := min F ch (if),£) >l — 2e . 
\ij>)ec 

To this end, we recursively define a sequence of subspaces Co D C% D . . . D Ck-i, 
and a corresponding sequence of code vectors \ipo), \ipi), • • • , \ipK—i) as follows: 



i = : C 
i > : d 



c 

vector of minimal fidelity in Cq 
vector of minimal fidelity in 
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By construction, dimC = K — i, and F m i n (Ci,£) = F(ipi, £(ipi)) = Fj. It is also 
clear that the minimum vectors |V>o)j l^i); • • • \1pK-1) form an orthonormal basis of C. 
Hence ttq = Y^q 1 anci , by relation (flTj) . 

1 - 



K i=o 



For any < t < K we therefore obtain 



1 1 *=! K-t t 

j=0 i=K—t 

where the last inequality follows from 1 > Fq > Fx > . . . > Fk-x > and 

1 - e < 1 F K -t 

~ K K 

is equivalent to 

K 

l-—e< F K -t , 

meaning that subspace Ck-i of dimension t has minimum fidelity larger than 1—eK/t. 
Setting t = [K/2\ completes the proof. 

B Average of |(^|P|?/;}| 2 

We show that independent of the normalized vector £ Hq 

mpm\ 2 ]u K = ^^ (48) 

(notations as in Sec. 14. 2p . By definition, 

[\{Mpm 2 \u K = J dm muPorfm 2 , 

where the integral extends over XJ(Hq) and Po is the projection on an arbitrarily 
chosen linear subspace Co C Hq of dimension K. We extend \tp) = \ipx) to an 
orthonormal basis \tpi) , ... ,\^Pm) of Hq, and chose 

C := span{|^i),.. . , \i^k)} ■ 

Then 

\m\ 2 h K = E / d^U)\Uu\ 2 \U l3 \\ 
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where Uij = (ipi\U\ipj) . Making use of the unitary invariance of p, this becomes 

K J dp(U) \Un\ A + (K 2 -K) J dn(U) \U U \ 2 \U 12 \ 2 . 

For the calculation of these integrals we refer to the work of Pereyra and Mello [33J , 
in which, amongst others, the joint probability density for the elements Uu, . . . , U\k 
of a random unitary matrix U G Uk has been determined to be 

/ k vn-fc-l k 

p(U n , ...,U lk ) = c(l-J2 \Uia\ 2 @(1 - E l^l 2 ) > 

V a=l J a=l 

where c is a normalization constant, and 0(x) denotes the standard unit step function. 
By a straightforward calculation, we obtain from this 



M- • M ' 
1 



dn(U) |C/n| 4 : 

Jdmw^wn? - M2+M , 

which immediately leads to Eq. (|48p . 



C Lower bound for code entanglement fidelity 

Without loss of generality we can describe a possibly trace-decreasing M as a unitary 
operation Uqe on QE which is followed by a projective measurement on E that may 
reduce the trace. That is, for a general state pq 

N(pq) = tr E (1q <8> Pw)Uqe Pq <8> ^£ U QE ] , 

where ipE is a fixed initial pure state of E, and projects on some subspace of He- 
Let again ipRQ be a purification of pq, pr = tiQipRQ, and let a normalized pure state 
ip'jlQE on RQE be defined by its state vector 

Wrqe) = ® iV)(lfl 8) C/qb) 8) , 

VP 

where p = trAA(pg). The state 4>'rqe is purification of its properly normalized partial 
states p' Q , p' E , Prqi and p' RE . Note that JV(pq) = Pp' RE - 

Precisely as in Sec. ([3]) it follows that there exists a recovery operation 1Z on Q 
satisfying 

F(ip RQ ,l R ® ft(/4 Q ) ) > 1- || p' RE -pr® p' E \\tr ■ 

By definition fj33[) of entanglement fidelity for trace-decreasing operations this imme- 
diately leads to 



F e (p Q ,KoJ\f) > p - \\pp' R E-PPR® Pe\ 



tr 
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which generalizes relation (|12p . 

Continuing in a similar manner as before in Sec. [3j we consider pq = ttq with the 
purification (fT5|) . and chose the unitary Uqe with projection Pyy such that 

N 

(i Q ® p w )c/ QB ^ Q )|i) = X>hfo>l») > ( 49 ) 

1=1 

where |1) = \iI>e), |2), . . . , \N) are again orthonormal vectors in He- Then, it is readily 
verified that 

j N K 

PPRE = ^ E E tr Q(^h Q )< c mlV) l c f)(Cml ® K)0'l , 
ij=l l,m=l 

N 

V PR® Pe = E ^Q^^c^/) Pfi 8) , 

ij=l 

where p = trM(irc), which precisely correspond to expressions (fTBj) . (fT7|) . As in 
Sec. ([3J) we conclude that 

F e (ir c ,TloN) > p - \\D\\tr , 
showing that trM(TTc) — \\ D \\t r is indeed a lower bound of F e (C,N). 

D Typical sequences 

The first property follows from 

1 = E^a> E PA > iV e> „2-« . 
A A e-typical 

To prove the second property we first realize that by definition 

P E>n = Pr( a Aj 1} . . . , A jn is e-typical" ) = Pr(|- log 2 ( Pil . .. Pjn ) - nH{A)\ < ne) 

n 

= Pr(|£(-log aPil -.ff(.4))| < ne). 
1=1 

The negative logarithms of the probabilities pj l can be understood as n independent 
random variables Y\ that assume values — log 2 pi, . . . , — log 2 pjv with probabilities 
Pi, . . . ,pjv- Their mean is the Shannon entropy H(A), 

N 

a = E(Yi) = -^Pilog 2Pi = H(A) . 
i=i 

This means that 

n 

1-P e , n = Pi(\J2( Y i ~ A*) I > ne) 
i=i 
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is the probability of a large deviation oc n. Since the variance a and all higher 
moments of Y\ — fj, are finite we can employ a result from the theory of large deviations 
[32] . according to which 

n 

Pr(|^(^- M )| > ne) < 2e~ n ^ , 
1=1 

where tp(e) is a positive number that is approximately e 2 /2a 2 . 

E Bounds for tiJ\f £ ^(7r Qn ) and \\tf e ,n{nQ n ) \\f 

It is convenient to introduce the complementary operation M. e ^ n of N E n by 

M m =M £ , n + M £ , n . 

The operation elements of M e ,n are exactly the e- "untypical" operation elements of 
N® n . Then, 

trAT E , n (7r Q J = trn £in (Ar^(7r Q J-^ £ , n (7r Q J) 

> trn £in AA^(vr Q J - tr M £ , n ^ Qn ) . (50) 

The inequality results from the fact that for two positive operators A, B always 
tvAB > 0, and therefore (indices suppressed) 

tvM{p) = trllM(p) +tr (l-U)M{p) > trUM(p) . 

The first term in Eq. (|5U|) can be bounded from below as 

tr n Ejn AT®> Q J = tr n e , n A^ n (<f ) = tr ^^.(^(vtq))^ > 1 - 2e~ n ^ , 

where we used inequality (|44p . The second term in Eq. (|50p obeys 

trM e>n (*Q«) = trAT^(7r Q J - trM £ , n (n Qn ) < 2e~ n ^\ 

by inequality ([32]). We thus find 

tr A4, n (vr Q J > 1 - 2(e- n ^ 2 ^ + e ~™M £ )) > 1 - 4 e"™^ , 

when il>3(e) := min{^i(e), ^(e)}- For large n the homogeneously distributed state 
7TQ n is almost certainly selected by the reduced operation M e%n . 

Now, let us address the Frobenius norm of N{iTQ n ) ■ For positive operators A,B 

\\A + B\\ 2 F = \\A\\ 2 F + \\B\\ 2 F +2tr AB >\\A\\ 2 F + \\B\\ 2 F . 

This can be used to derive 

\\V s>n oM m (7r Qn ) |||=|| V £ ,n o {K,n + M e , n )(n Qn ) \\ 2 F >\\ V e ,n ° K,n(^Q n ) III • 
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Thus 

|2 _ NT) „ \r \ 112 



|-A4,n(T0„)|lF = WPe,n°Ne,n(KQ n )\\ F 



[2 



< [|7> e ,„o^> 0n )b 

\2 



|n £in (Ar(7r Q ))^n £ » 2 



1 : e-typical 
< 2 -n(S(Ar( 7 rg))-3 £ ) ^ 

where we used ()43|) and pi < 2~ Tl (' s W( 7r Q)) _£ ) to derive the last inequality. 
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